Is Magnitude Estimation Worth the Trouble?

نویسندگان

  • Shin Fukuda
  • Grant Goodall
  • Dan Michel
  • Henry Beecher
چکیده

In sentence acceptability experiments, subjects are typically asked to indicate their response to sentences in one of three ways. In a yes/no forced choice task, subjects simply indicate whether or not the sentence sounds good. In an n-point numerical scale (or Likert scale) task, the extremes of a subjects choose a number on the scale that reflects their overall response to the sentence. Finally, in a magnitude estimation (ME) task, subjects compare experimental sentences to a reference sentence. This reference sentence is associated with a number (or subjects may choose this number on their own). Subjects are instructed to rate the experimental sentences in relation to the number given to the reference sentence. If the experimental sentence sounds twice as good as the reference sentence, for instan should divide it in half, etc. Each of these response methods has potential advantages and disadvantages. The yes/no method has the virtue of being very easy for subjects to understand, but it is often thought to be relatively coarse-grained and to require large numbers of subjects to detect fine differences. The n-point numerical scale arguably yields finer-grained results, while still being easy for subjects, but there is no guarantee that the chosen scale will allow as many distinctions in acceptability as subjects actually perceive or that subjects will treat the distance between any two adjacent points on the scale as being the same (e.g. in a 5-point scale, subjects might treat the difference between 1 and 2 as being larger or smaller than that between 3 and 4). ME is clearly not easy for subjects to understand, in that it is an unfamiliar task that requires some mathematical sophistication, but it could reasonably be expected to overcome the two disadvantages seen for the n-point scale. Subjects are able to make as many distinctions as they want, and since they are explicitly asked to make ratio judgments (i.e. how many times better or worse the experimental sentence is compared to the reference sentence), one would expect less uncertainty about the nature of the results. In this study, we submit these three response methods to a critical examination by comparing the results obtained by each in three otherwise identical experiments. In section 2, we review the previous literature on these methods, and in section 3, we present the set of experiments that constitute the core of our contribution, concluding that some of the claimed advantages of ME do not appear to be empirically supported. We devote some attention to differences among the three methods that emerged in our results in section 4 and we explore some other results of interest in our experiments in section 5. Section 6 presents conclusions and implications for the working syntactician.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DASTWAR: a tool for completeness estimation in magnitude-size plane

Today, great observatories around the world, devote a substantial amount of observing time to sky surveys. The resulted images are inputs of source finder modules. These modules search for the target objects and provide us with source catalogues. We sought to quantify the ability of detection tools in recovering faint galaxies regularly encountered in deep surveys. Our approach was based on com...

متن کامل

Single station estimation of earthquake early warning parameters by using amplitude envelope curve

In this study, new empirical relationships to estimate key parameters in Earthquake Early Warning (EEW) system including magnitude, epicentral distance and Peak Ground Acceleration (PGA) are introduced based on features of the initial portion of P-wave’s amplitude envelope curve. For this purpose, 226 time series recorded by bore-hole accelerometers of Japanese KiK-net are processed for earthq...

متن کامل

Voltage Flicker Parameters Estimation Using Shuffled Frog Leaping Algorithm and Imperialistic Competitive Algorithm

Measurement of magnitude and frequency of the voltage flicker is very important for monitoring andcontrolling voltage flicker efficiently to improve the network power quality. This paper presents twonew methods for measurement of flicker signal parameters using Shuffled Frog Leaping Algorithm(SFLA) and Imperialist Competitive Algorithm (ICA). This paper estimates fundamental voltage andflicker ...

متن کامل

Costs of treatment after renal transplantation: is it worth to pay more?

Objectives: The present study aimed to provide an estimation of the current financial burden of renal transplantation therapy for insurance organisations.Methods: An Excel-based model was developed to determine the treatment costs of current clinical practice in renal transplantation therapy (RTT). Inputs were derived from Ministry of Health and insurance organizations` database, hospital and p...

متن کامل

Pseudo Zernike Moment-based Multi-frame Super Resolution

The goal of multi-frame Super Resolution (SR) is to fuse multiple Low Resolution (LR) images to produce one High Resolution (HR) image. The major challenge of classic SR approaches is accurate motion estimation between the frames. To handle this challenge, fuzzy motion estimation method has been proposed that replaces value of each pixel using the weighted averaging all its neighboring pixels i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011